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Two new theoretical models for representing random traffic in connecting 
networks are presented. The first is called the "thermodynamic" model and 
is studied in detail. The second model is formulated in an effort to take 
methods of routing into account and to meet certain drawbacks of the "ther- 
modynamic" model in describing customer behavior; since it is more realistic 
than the first, it leads to results that are vastly more complicated and must be 
described in another paper. 

The "thermodynamic" model is worth considering for four reasons: 
(/) // is faithful to the structure of real connecting systems. Indeed it is 
an improvement over many previous models in that it only considers physi- 
cally accessible states of the connecting network, while the latter suffer the 
drawback that a large fraction of the network stales on which calculation is 
based arc physically meaningless, being unreachal)le under normal opera- 
lion. 

(2) It gives rise to a relatively simple theory in which explicit calculations 
are possible. 

(3) The " thermodynamic" model provides a good simple description of 
traffic in the interior of a large communications network. 

(4) It has an analogy to statistical mechanics which permits us to be 
guided by the latter theory as we try to use the model to understand- the 
properties of large-scale connecting systems. 

The two models to be described differ in only one respect. In the first (the 
"thermodynamic") model, the system moves from a stale x to a stale y thai 
has one more call in progress, at a rale X; the effective calling-rate per idle 
inlet-outlet pair is thus proportional to the number of paths usable in x from 
that inlet to that outlet. In the second model, the calling-rate per idle inlet- 
outlet pair is set at X, and is then spread over the paths usable in x from that 
inlet to that outlet in accordance with some routing rule. This provides a 
mathematical description of routing, and avoids the unwelcome feature that 
a single customer's calling-rate depends on the stale of the network. 
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The "thermodynamic" model is based on the single postulate that the 
''equilibrium" probabilities of the states of the connecting network maximize 
the entropy functional for a fixed value of the traffic carried. These probabili- 
ties have the same geometric or exponential form, as the canonical Maxwell- 
Bollzmann distribution of statistical mechanics. The theory developed applies 
to any connecting network regardless of its structure or configuration. The 
number of calls in progress is analogous to the energy of a physical system. 
As in statistical mechanics, important averages can be expressed as loga- 
rithmic derivatives of a generating function analogous to the partition func- 
tion of physics. 

It is possible to give an interpretation of the maximum entropy postulate 
in terms of random behavior at the inlets and outlets of the connecting net- 
work; this interpretation leads to a stochastic progress z t of the familiar 
Markov type, for which the canonical distribution is invariant. The transi- 
tion rate matrix of z t is self-adjoint in a suitable inner product space, so 
that the approach of z t to equilibrium is easily studied, with resulting appli- 
cations to traffic measurement. 
i 

. INTRODUCTION 

Like the physicist., the traffic engineer is faced with the study of an 
extremely complex system which is best described in statistical terms. 
The great success of the theoretical methods of statistical physics has 
given rise to a fervent hope, sometimes voiced among traffic theorists, 
that similar methods exist and can be found for the study of congestion. 
Indeed, the problems are much the same: one desires a small amount of 
"macroscopic" information about averages, based in a rational way on 
vast complexities of detail. A. K. Frlang was probably influenced by 
statistical mechanics when he introduced his method of "statistical 
equilibrium" into traffic theory. This method has had great success in 
dealing with problems of the birth-and-death type, like trunking and 
quoueing, but as applied to more complex cases it has led mostly to alge- 
braic and combinatory difficulties. Nothing as elegant or powerful as 
statistical mechanics has resulted so far. 

We shall present two traffic models in this paper. The first is the out- 
come of a deliberate attempt to ape the methods of physicists in statis- 
tical mechanics, and thus to realize, at least in part, the hope mentioned 
above. It is called the "thermodynamic" model, and it is treated in de- 
tail. The second model is introduced later in the paper in an attempt to 
avoid certain drawbacks that appear in the interpretation of the "ther- 
modynamic" model. Since it has independent interest and leads to in- 
volved, more realistic results, it is studied in detail in a later paper. 
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The approach taken in the "thermodynamic" model bears a close 
analogy to the methods of statistical mechanics, and is based on the 
single postulate that the "equilibrium" probabilities of the states of the 
connecting network maximize the entropy functional for a fixed value of 
the traffic carried. We develop a theory, briefly summarized in the next 
paragraph, by deducing the consequences and interpreting the meaning 
of the one basic assumption. 

The state probabilities that maximize the entropy for a given carried 
load form a distribution function over the set of states that has the same 
geometric or exponential form as the canonical (or Maxwell-Boltzmann) 
distribution of statistical mechanics. The theory applies to any connect- 
ing network, regardless of its structure or configuration. The number of 
calls in progress is analogous to the energy of a physical system. As in 
statistical mechanics, important averages can be expressed as logarith- 
mic derivatives of a generating function analogous lo the partition func- 
tion of physics. It is possible to give an interpretation of the maximum 
cut ropy postulate in terms of random behavior at the inlets and outlets 
of the connecting network. This interpretation leads to a stochastic 
process z t of the familiar Markov type, and is such that any stochastic 
process based on it satisfies the maximum entropy postulate. The transi- 
tion rate, matrix .4 of z t is self-adjoint in a suitable inner-product space; 
its characteristic values are real and non-positive, and can be studied by 
classical variational methods. In terms of these characteristic values the 
approach of z, to equilibrium can be studied, with resulting applications 
to traffic measurements. It turns out that the covariance of any function 
of z t is strictly positive. The paper ends with a time-dependent or non- 
stationary generalization of the maximum entropy postulate that has 
close analogies with the statistical "derivation" of thermodynamics. 

II. PRELIMINARIES 

In order to give an adequate summary and discussion of our theory in 
Section in, it is necessary to present first its concepts, terminology, and 
notation. Virtually all the notions about to be discussed have appeared 
in earlier papers by the author 1 ■- so only a brief resume is given here. 

Let S be the set of possible (or permissible, or both) states of a connect- 
ing network, and let x, //, • • • be variables ranging over S. The elements 
of S are partially ordered by inclusion ^, where x ^ y means that x 
can be obtained from y by removing zero or more calls. Furthermore, 
the states x e »S can be arranged in an intuitive manner in the stale-dia- 
gram, the Hasse figure for the partial ordering ^ . This figure is con- 
structed by partitioning the states in rows according to the number of 
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calls in progress. The unique zero state (in which no calls are in progress) 
is placed at the bottom of the figure; above it comes the row of states 
with one call in progress; and so on. The figure is completed by drawing 
a graph with the states as nodes, and with lines between states (in adja- 
cent rows) that differ in exactly one call. In an earlier work 2 we made the 
assumption that in a given state at most one call could be in progress 
between a given inlet and outlet; it is convenient to discard this assump- 
tion here. 

If the connecting network under study is in a state x, it can move only 
to states which are neighbors of x, i.e., are obtainable from x by adding a 
new call or terminating a call in progress. It is useful to divide the neigh- 
bors of x into two sets A x and B x where 

A x = set of states immediately above x, i.e., accessible from x by add- 
ing a new call, 

B x = set of states immediately below x, i.e., accessible from x by a 
hangup. 

For any set X, the notation | X | is used to denote the number of ele- 
ments of X. The states x t S can be defined 2 as sets of chains on a graph, 
one chain for each call in progress. Hence it is natural to use | x | to 
mean the number of calls in progress in x. The fcth level L k is the set of 
all states with k calls in progress, i.e., 

U = [xeS: \x\ = k). 

III. SUMMARY AND DISCUSSION 

We start, in Section IV, with a brief informal discussion of what is 
meant, heuristically as well as precisely, by "equilibrium." 

In Section V we formulate and discuss the maximum entropy postulate, 
according to which a suitable "equilibrium" distribution \q x , x e S} of 
probability over .S' is obtained by choosing the probability vector q so 
as to maximize the entropy functional 

H(q) = -S ^logr/x 

xtS 

for a given value of the average number of calls in progress, i.e., for 

xeS 

Various heuristic arguments are adduced to support the prima facie 
reasonableness of this principle. In Section VI it is shown that the 
maximizing probability vector q is given by 
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X |x| 

q * = *Q0' XeS 

xtS 

and X is a constant determined uniquely by the equation 

m = \(d/dk) log<t>(X). 

Because of their close similarity to corresponding notions from sta- 
tistical mechanics, the vector q and the function $( • ) are henceforth 
called the canonical distribution and the partition function, respectively. 

In Section VII we have collected together various properties of the 
partition function, most of them based on the partial ordering ^ of S. 
Among these are expressions for $( • ) in terms of the Mobius function 
for ^, and in terms of several sets of "characteristic polynomials" 
associated with ^ and 8. 

The canonical distribution q is placed in a dynamic context in Section 
VIII. This is done by defining a Markov process z t (taking values on *S') 
for which q forms a stationary distribution. The transition rate matrix 
A (infinitesimal generator) of this process allows one to give interpre- 
tations of this dynamic context in terms of calling rates and mean 
holding-times. An informal description of the process z t is this: if it is in 
state x, it moves to a state y e A x at a rate X, and to a state y e B x at a 
rate set at unity by convention. 

A full discussion of the analogy between the "thermodynamic" 
theory of traffic and statistical mechanics is given in Section IX. For 
purposes of illustration, we mention that the number of calls in progress 
corresponds to the energy of a statistical mechanical system, and that 
the constant X is related to the calling rate and corresponds to the 
temperature (up to a monotone transformation). 

The reasonableness of z t as a description of an operating connecting 
network is discussed and criticized in detail in Section X. Two possible 
interpretations of the inlets and outlets are considered: in one, the 
inlets and outlets are the ultimate terminals of the system, beyond 
which there is no more switching equipment; in the other, the inlets 
and outlets are switching centers such as PBX's, frames, or individual 
crossbar switches, acting as sources of traffic for a network under study. 
In the first interpretation, there can be at most one call in progress on 
an inlet or an outlet; in the second, there may be several. 

Regardless of which interpretation of the inlets and outlets is adopted, 
the transition rate matrix A for z t must be interpreted as saying that 
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the calling rate between an inlet and an outlet in a given state x is 
proportional to the number of free paths that x provides between that 
inlet and outlet. This assumption is unacceptable for the interpretation 
of inlets and outlets as ultimate terminals; it is not entirely unreasonable 
if the inlets and outlets are local switching centers. 

Section XI is devoted to describing, as an alternative to z t , a Markov 
stochastic process on S based on the assumption that the calling rate 
between an idle inlet terminal and an idle outlet terminal is a constant 
X. This calling rate is then spread over the possible ways of realizing 
the call in question in the current state of the network in accordance 
with some method of routing. A mathematical description of such a 
method of choosing routes for calls is given. This description leads 
directly to a transition rate matrix Q for a process x t in which every 
idle inlet-outlet terminal pair has a calling rate X in every state. The 
possibility that z t may be a useful perturbation of x t is considered. 

In Section XII it is observed that the rate matrix A for z t is a self- 
adjoint operator in a suitable finite-dimensional inner product space. 
This implies that the characteristic values of A are real and nonpositive, 
and leads to bounds on the rate of approach of z t to equilibrium. These 
bounds can be applied to estimate the covariance of z t , and the sampling- 
error incurred in measuring carried loads by averaging z t , or discrete 
observations of z t . In particular it is shown that the dominant (i.e., 
that of smallest nonzero magnitude) characteristic value n of A satisfies 

-im/a-) ^ n < o, 

where m and a are (respectively) the mean and standard deviation of 
the load associated with the equilibrium probability vector q for z t , so 
that 

m = 52 I * I ff» » 

xiS 

<? = £ (I x I - ™)*Qx- 



In Section XIII we give a formula for the covariance of any process 
ft defined by applying a function /( • ) to z t , i.e., 

St = /(**)■ 

This covariance is always positive. Applications of this formula to traffic 
averages are described briefly in Section XIV. Finally, Section XV con- 
siders a time-dependent generalization of the variational principle on 
which the "thermodynamic" theory of traffic is based. 

We conclude this section with an appraisal of the "thermodynamic" 
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theory presented herein. This will take the form of a list of comments, 
first -pro, then con, and then a defense. 

1. There exist theories'' 4 for connecting networks in which it is as- 
sumed that the links of the system are busy or idle with a given probabil- 
ity, all independently of one another. It can be verified that an over- 
whelming fraction of the states of the system so considered are in fact 
not physically meaningful states that the system can reach under nor- 
mal operation. The theory presented here is based only on permitted, 
physically meaningful states, and so is not open to this serious objection. 

2. The theory provides a uniform method of treating any connecting 
network in that the calculation of equilibrium probabilities always re- 
duces to that of the partition function. In most other treatments the 
nature of the algebraic process of calculating probabilities depends heav- 
ily on a detailed account of the network configuration; in our theory it 
depends on the network only via the numbers | L\ |, | Lt |, • • • . 

3. The maximum entropy principle can be given a certain informal, a 
priori justification. It provides a "conservative, worst possible case" 
approach to problems and processes of fantastic complexity. This is be- 
cause it can be interpreted as enjoining that an "equilibrium" distribu- 
tion of probability for given carried traffic correspond to a condition of 
maximum ignorance of the actual state of the connecting network. 

4. The canonical distribution q that results from the maximum en- 
tropy postulate can be embedded in a dynamic model of traffic by de- 
fining a Markov process z, for which q is the invariant or stationary dis- 
tribution. This dynamic model is described by a transition rate matrix 
which is a self-adjoint operator, a fact which makes it possible to study 
the time-dependent behavior of z, in a simple approximate way, with 
applications to traffic measurement, for instance. 

">. A very serious drawback of the "thermodynamic" theory is that its 
natural interpretation in terms of calling rates appears to be unreason- 
able in most practical cases. For this reason it will probably remain an 
amusing curiosity, rather than become an engineering tool. 

0. The problem of calculating the partition function *(•) is, as in 
statistical mechanics, very difficult except in cases of unrealistic sim- 
plicity. Thus, even if its assumptions arc granted, our "thermodynamic" 
theory does not afford much progress in calculating quantities of inter- 
est. 

7. The theory can take into account only one of the many different 
possible methods of routing calls in operating networks. Thus it cannot 
help the designer choose among alternative methods. 

By way of defense against the objections just raised, these points can 
be made: 
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(i) Comment 5, that the interpretation of the "thermodynamic" 
theory in terms of calling rates is unreasonable, depends on a natural, 
but not necessarily valid or compelling, assignment of causes for new 
calls. 

(n) Although the calculation of *( ■ ) is hard, it is at least a definite 
combinatory problem solvable in principle by counting; thus at least 
part of the problem of obtaining state probabilities is disposed of. 

(in) It is doubtful whether routing methods make as much as an or- 
der of magnitude of difference in carried loads in large systems; hence it 
is reasonable to ignore them in a relatively crude theory such as the 
present one. (See however, Benes, Ref . 1 1 .) 

The theory presented in this paper should be judged by its success in 
practice as well as by its agreement with our preconceptions. I believe 
that in spite of the major failings mentioned, the theory musters interest 
enough to warrant its presentation to traffic engineers, if only because 
its concepts and results may prove useful in more realistic approaches. 

IV. EQUILIBRIUM 

Quantities that are of interest in the design of a connecting network, 
such as the average load carried, the variance of the load, or even the 
probability of blocking, can often be calculated from a knowledge of 
some "equilibrium" or "stationary" state probabilities \q x , .r e 8} for 
the network of interest. These probabilities are usually assumed or 
proven to be of "equilibrium" type in the sense that they have some 
physically reasonable in variance property. 

Since the concepts of stationarity and equilibrium can assume many 
precise forms of varying strength, it is important to consider briefly 
some of these senses. The strongest notion, of course, is that of strict 
stationarity of a stochastic process, defined by the condition that all the 
finite-dimensional distributions be independent of time, i.e., be transla- 
tion-invariant, A whole class of weaker notions can be obtained by re- 
quiring only that the distributions of dimension not greater than n be 
invariant, The notion of wide-sense stationarity, defined by the condi- 
tion that the covariance depend only on the difference of its arguments, 
is still another concept of stationarity, formulated for a moment rather 
than a distribution. Again, Markov processes are described as homogene- 
ous or stationary if their transition probability operators are time-in- 
variant, 

"Equilibrium" is a word that usually connotes a stable, quasi-static 
random behavior which is perhaps a condition of attraction for a process, 
in the sense that a process not in equilibrium tends toward it, Ergodic 
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Markov processes with denumerable state spaces are typical examples. 
It is to be remarked, though, that use of the word "equilibrium" usually 
implies a nondegenerate limiting behavior for a process y, under study 
as / — * » . Thus a time-homogeneous Markov process may not have a 
genuine "equilibrium" distribution because it in some sense "blows up," 
e.g., the process may take values on the integers and the probability 
mass may move out toward + oo , even though the transition probabili- 
ties arc time-independent, in such a case, clearly, no first-order distribu- 
tion can be assigned which is time-invariant. 

The analytical expression of "equilibrium" often takes the form of a 
statement to the effect that an operator has zero as a characteristic value. 
Perhaps the most familiar example of such a statement arises in the case 
of a Markov process in continuous time with a transition rate matrix A ; 
the equilibrium equation is Aq = 0, for a probability vector q.* 
This equation, together with its connections to semigroups, to Markov 
processes, and to the notion of statistical equilibrium used in traffic 
theory, is discussed immediately below. 

A traditional analytical method in telephone traffic theory is that of 
"statistical equilibrium," due to Erlang. 5 This method may be described 
heuristically as follows: A notion of equilibrium is defined by the prop- 
erty that the rate of flow of probability into (or onto) a state equals that 
out of (or from) the state; this equilibrium is expressed in a set of equa- 
tions among the state probabilities, the so-called statistical equilibrium 
equations; the "equilibrium" state probabilities are then taken to be 
(or defined by) the solution of these equations. 

The method of statistical equilibrium can be interpreted in the mathe- 
matically rigorous context of semigroups of positive operators, here the 
matrices of transition probabilities \Q(l), t real} for a Markov process 
x t taking values in S, with 

Q(t) = (<u„(t)) 

q iy (t) = Pr {state of system is y at / if it was x at j . 

The generator A of the semigroup is the matrix of transition rales or the 
derivative 

A =lim A(t) - 1 =Q'(0). 

The matrix .4 expresses the relative probabilities of the various changes 

* We are using the convention (Aq) x = 2l a uz q u , rather than the more usual 

v 

(Aq), = 5Z a Iy q u . 

ii 
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that can take place in a small amount of time, and indeed 

Q(t) = I + At + o(t) as t^O. 

In terms of the generator A the equation of statistical equilibrium can 
be written as Aq = 0, which expresses the fact that the vector q of 
state probabilities is an eigenvector of A corresponding to a zero eigen- 
value of A . From the semigroup property 

Q(t + s) = Q(t)Q(s) 

it follows that 

Q{t) = exp At 
L QAvxit) = q* , * e S, t real 

;/ 

the last equation expressing the invariance of q under the transition 
probability matrices Q(-). 

V. THE MAXIMUM ENTROPY PRINCIPLE 

In the method of statistical equilibrium, the state probabilities are 
calculated a posteriori from a prior equation expressing an equilibrium or 
invariance principle. This equation is either postulated or is derived 
from assumptions that lead to a Markov stochastic process as a model for 
the operating network. 

In the present work we use a variational principle rather than an equi- 
librium principle as a basis for calculating "equilibrium" state proba- 
bilities. In drawing this distinction we refer only to the immediate form 
of the assumptions and derivations, and imply no absolute distinction, 
since an "equilibrium" principle can almost always be given a "varia- 
tional" form. For example, if A is a transition rate matrix for an ergodic 
Markov process, and A is self-adjoint with respect to an inner product 
(•,•), then the "equilibrium" probability vector q, i.e., the solution of 
Aq = is equally well described as the vector which maximizes the 
Rayleigh quotient 

(Aq,q) 
(?,?) 

It will turn out that the probabilities [q x , x e S] derived from our 
variational principle also have an invariance property expressible, as in 
the example given, in terms of the self-adjoint generator A of a Markov 
semigroup by the equation Aq = 0. This equation can be interpreted 



THEORY OF TRAFFIC 577 

as a "statistical equilibrium" equation, and the elements of A related 
to calling-rates and hangup rates, in the various states x e S. 

However, instead of starting with a suitable matrix A to represent 
the infinitesimal dynamic behavior, and solving Aq = in order to 
obtain an equilibrium distribution \q x , x e S} over the states of the 
system, we shall directly choose a certain q, to be used as an "equi- 
librium" distribution for calculating quantities of interest, according 
to this criterion: The entropy functional 

II (q) = -2 9' log ( U 

xts 

is to be as large as possible subject to the conditions 

q* ^ 0, X e S 

S fr = i, 

xeS 

y £,\x\q x = m, 

where m is a given number, the average load carried. The first two condi- 
tions ensure that only bona fide probability distributions are considered, 
while the third enjoins that q give rise to m as the mean number of calls 
in progress in equilibrium. This criterion or method for choosing a prob- 
ability distribution over S we call the maximum entropy principle; it is 
exactly analogous to that used in statistical mechanics, provided that 
the number of calls in progress is interpreted as the energy of the me- 
chanical system. We have already stated that this principle leads to a 
unique q which is exactly the same as would be obtained by a particular 
choice of A , given later, and solving .4*/ = 0; this matrix .4 has a definite 
interpretation in terms of system behavior during small periods of time. 
A measure of justification for using the maximum entropy principle 
can be obtained from five arguments: 

(1) Insofar as a high value of the entropy functional is an indication 
of a low degree of information, so far can use of the principle be inter- 
preted as postulating that an equilibrium distribution \q x , x t S\ corre- 
sponds to a condition of maximum ignorance subject to a given average 
number of calls in progress. The principle may thus be said to represent 
a "safe" or "worst case possible" approach to the problem. Exactly the 
same principle is used in statistical mechanics to obtain the canonical 
distribution. In both cases it is a reasonable and systematic way of 
throwing up our hands. 

(2) The principle is appealing for the obvious reasons of unity, uni- 
formity, simplicity, and elegance. 
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(3) It leads to a theoretical structure similar to that of statistical 
mechanics. As in the physical theory, statistical quantities of interest are 
calculated from a -partition function, characteristic of the network under 
study, that depends on purely combinatory properties of that network. 

(4) The principle of maximum entropy leads to a unified theory ap- 
plicable to all connecting networks. That is, the resulting "equilibrium" 
distribution depends algebraically on the structure of the network in a 
way that in a sense is uniform for all networks. 

(5) The principle can be given a dynamic context in terms of Markov 
processes. This context permits the study of the approach to equilibrium 
in time, with important applications to sampling error. 

VI. THE CANONICAL DISTRIBUTION 

In the next few sections we develop some of the principal consequences 
of the maximum entropy principle, and examine their similarity to 
statistical thermodynamics. In the present section we determine the 
distribution \q x , x e S] which maximizes H(q) for a given average load 
carried. The following lemma is no doubt well-known, especially to 
physicists; since its application in traffic theory is relatively new, its 
detailed proof is included for completeness. 

Lemma 1: Letf(-) be a nonnegalive function defined on S, and let 

*«) = £ 5"". 

xt.1 

The maximum of 

IH<[) = -H 1x\^(lx 
ztS 

subject to the conditions 

q x ^ 0, x e S 

^ q z f(x) = m f , (m/ a given positive number in the closed (2) 
xtS convex hull of the range of /( • ) , ) 

is 

H(q) = log ^(o) — m f log co, 
where co is the unique positive solution of 

co{d/du) log^(w) = m f . 
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The maximum is achieved by the choice 

qt = [*/»/*(«)], 

= exp { —aif(x) — a-2 — 1 ) , x e S 

where a\ , a% are {the values of Lagrange's multipliers) determined by any 
two of the relations 

as = log*(r~ ai ) - 1, 

m f = £/(#) exp (— aif(x) — a 2 — 1), 

xtS 

o) = exp { — a t } . 
Proof: With a^ and a* as Lagrange's multipliers, we form the expression 

h = — X) 9* log (7* - «i 2 /(*)?* — °2 2 (Z*> 

X«5 T«S T«S 

differentiate with respect to each q x , and set the resulting derivatives 
equal to zero. This gives the equations 

dh/dq x = -(log g, + 1 + 7i/<-<-> + oz) = 0, x e S (4) 

whose solution is (8). The multipliers a x and o a are to be determined 
from the conditions 2«* & - * a,u l 5Z«s/(aO& = »'/ • The first gives 

1 = c'^- I £ exp [-«,/(*)], 

XlS 

a, = k>g*(e~ fl| ) - 1, 
while the second yields 

m, = c" 1 "-- 1 Z/U) expl-a,/(a;)] 

E/(.r)exp[-fl 1 /(.r)] 



£ exp[ — ai/(.r)] 

If.S 

Setting a) = exp {— Oi}, it is found that co should he a solution of the 
equation 

u)(d/dtc) Iog¥(w) = m-f > 0. (5) 

From the fact that 

■£ log MK<f°) = -^ E (/(*) + £ log*( C -°)Y c- a/w 
da 2 *(e a ) xTs \ «o / 

> 
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it is easily shown (Khinchin, 6 p. 77) that there is exactly one solution 
oj of (5), and that co is positive. 

A relative extremum of H( q) in q ^ subject to (1) and (2) must 
satisfy equations (4). Since these have only one solution there is only 
one such extremum. To show that it is a maximum it is enough to 
show that the matrix of second derivatives of H(q) with respect to the 
components q a of q is negative definite. However, this is straightforward, 
since 

if x 9^ y 
d-II 

dq x dq u 

In Lemma 1 we let 

/(*) =\x\ 

= number of calls in progress in state x 

and we obtain 

Theorem I: Lei m > 0; let 

IfS 

and let X be the unique (positive) root of 

m = \(d/d\) Iog*(X). 

The maximum of H(q) = -£«« <1* log q* , subject to the conditions 
that q be a ■probability vector over S and that z2 xt s \ x \ q x = m, is 

//max = l0g*(X) - mlogX 

and is achieved by the vector q with components 

This is the distribution of probability over S that is determined 
uniquely by the maximum entropy principle; as noted before, it is the 
canonical distribution. The function <!>(•) is called the partition function 
of the connecting network whose states form the set £. Since m deter- 
mines X uniquely and vice versa, we can use X as the parameter that de- 
termines the average traffic level instead of m. Indeed, m is a monotone 
increasing the function of X ^ 0. Also it can be seen that moments of 
the distribution of the number of calls in progress (other than the mean) 
can be calculated from <£>(•) by logarithmic differentiation. 
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VII. PROPERTIES OF THE PARTITION' FUNCTION 

In this section we exhibit various identities and relationships that are 
typical of the partition function <£(•). This function is the generating 
function of the number of states in a given level; that is 



*(*) = E **!■&* I. 



iv = max | x \, 

xiS 



Thus the problem of calculating X, <p( • ), and q in our model reduces 
to the calculation of the sequence 



J A) L \ L\ 



and vice versa. 
Remark 1 : 



\i*\=l E UJ 

K ytLi e _ l 

EU«I = EU-I = *'(D 

XfS XtS 

The first part of this result was proven as Theorem 1 in Ref. 2, and it. 
implies the second part. 

The Mobius function /x( ■ ) of the partially ordered system (»S, ^ ) is 
defined recursively by 

M (0) = 1, M (.c) - -Em(//) if •<• > 0, XeS. 

We have remarked in previous work (Ref. 2, .Section VII) that if S is 
a class of network states, then n( ■ ) takes on the simple form 

M (.r) = (-1)" 1 |.r | !. 

We define the generating function .l/( • ) by 

M(S) = E£ |J V.r). 

xtS 

Since 

x,s x,s I x | ! 

it can be seen that (except for a change of sign in the generating variable ) 
*( • ) is the exponential generating function associated with il/( • ). Thus 
we have 
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Remark 2 : 

M(Z) = f e~"<t>(-Zu)du 

• o 

Proof: 

M($) - £?\L k \-(-l) h k\ 

A-=n 

= e(-«*ii»i r Vvdu 

*=o ■ o 

= / e - "<£>( — £u) du. 
•'o 

In analogy with Birkhoff, 7 p. 15, (12), we define for each x e S a 
characteristic polynomial by the recursion formula* 

P*U) = s |xl - 1^)- 

This is related to the Mobius function /*( ■ ) by the fact that if n u ( ■ ) 
denotes the Mobius function for the set \x: x ^ y\, then 

p.(«) = E*"W*). 

However, the partial ordering of the cone j.c x ^ /yj is again of the 
same form as that of S; i.e., there are exactly (| x — y \)\ ascending 
chains between // and x, all of length | x - y |. Hence, by ref. 7, p. 15, 
(ID, 

(-l) |j ~ tf l 

(|x-y|)i 



M„U) = 71 TTTI = m(* - V, 



and 



p a (€) = E« w j»(*-v) 

= t ] + E € w m(* - »)■ 



!/<* 



Let now 



«.(€) = E $"" 



* Actually, BirkholT's polynomial p,[f| equals fio,(£). The definition we use is 
more convenient for our purposes. 
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The Mdbius inversion formula gives 

t ] = ZmU - y)q y (Z) 

V<X 

VM) = Em(s-2/)P„U). 

To calculate q x ( ■ ) explicitly, we note that if 2s k ^ | x |, then, using 
the cup H for set intersection, 

|L,n \y:y£x] \ = Ofy, 

i.e., there are exactly ( ' J states with /,• calls up below any state x. 
Hence 

Let us write 

where /•„( • ) are functions to be determined. Using the Mdbius inversion 
formula once more, we find that one choice of the r z 's is 

V<-r 



so that 



= P,(€) - g.(t) 



and 



= 2* I -nu 

It is apparent that 

•tt + t) = Zd + ^. M = Ezf'f'V. 

xiS xtS A=0 \ '' / 



mber of elements of L k less than or equal to x. 

A=0 
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Since for ^ k ^ I x I there are precisely 



:') 



elements in La- that are below x, we have 

*(1 + *) = E P»(f). 

XtS 

The preceding results yield the following identities for *( • ) : 

<p(x) = E x w -EEm(j!- /yk,(A) 

rti! Xi8 ll<x 

= EEr„(X) 

J-t.--' //<-r 

= LEW -x' 1 ") 
= E *>«(*) - E«*00 

rr«.S 3-tS 



VIII. A REVERSIBLE MARKOV PROCESS FOR WHICH THE CANONICAL DIS- 
TRIBUTION IS INVARIANT 

We shall describe an orgodic reversible Markov process z t , taking 
values in the set S of states, and having the property that its stationary 
distribution over 8 is precisely the canonical distribution derived from 
the maximum entropy postulate. This Markov stochastic process can 
be used to place the canonical distribution into a dynamic context by 
exhibiting it as invariant under a semigroup of positive operators, viz., 
the transition matrices of the Markov process in question. The transi- 
tion rate matrix A of this process, i.e., the generator of the semigroup, 
then provides several interpretations (cf. Section X) of this dynamic 
context in terms of behavior at the terminals of the networks, i.e., in 
terms of calling rates and mean holding-times. 

Let x cSbea possible state of the network. In Section II we have 
introduced the sets of states A x and B x with 

.4 X = set of states immediately above x, i.e., accessible from x by 
adding a new call, 

B x = set of states immediately below x, i.e., accessible from x by a 
hangup. 

The process z t to be considered can he described heuristically by 
saying that if z t = x then z>.) is moving to each y e A x at a rate X > 0. 
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to each y e B x at a rate unity, and to any other state at a rate zero. Its 
transition rate matrix A = (a xy ) is given by 



a*v 






- | x | - X | A x 

I 



if 


V = •'" 


if 


y*B x 


if 


y « -lx 


if 


y *A X U B x 



X 

it' // < A, U /?, and y ^ .r 

With this matrix we can define a Markov process z t in the usual way. 
(Cf. Doob. 8 ) A discussion and critique of possible physical interpre- 
tations of the rate matrix A is given in Section X. 

The probabilistic interpretation of the rate matrix A is that if z, = x 
there is a conditional probability \h + o(h) that z lv u = y, for y e A T ; 
there is a conditional probability h + o{h) that 2*+* = //, for // e #, ; 
there is a conditional probability 1 — \h \ A x \ — h \ x | — o(h) that 
2,+/, = .r; all other events have a conditional probability o(h), as h — >■■ 0. 
The constant X is the calling rate per idle path. 

An alternative informal description of the Markov process z t is as 
follows: the length of time spent in any state x is a random variable 
independent of all other lengths of time spent in a state, having a 
negative exponential distribution with a mean 

1 



x\ + \\ A, 



At the end of a stay in x, a new state is chosen (independently of every- 
thing except x) from A x U B x according to the probabilities 

for elements of A z 
for elements of B x . 



x | + X \A X 

1 
x I + X I A t 



The equation Aq = is the matrix-vector form of the statistical equi- 
librium equations for the process z t . These equations can be written out 
and solved explicitly, as follows: Aq = is equivalent to 

J | .r | + X | A x \}q, = 2 to + X 2 Qv » -i ; * s - (6) 

We find by substitution that q x taken proportional to X M gives a solu- 
tion. Hence the unique normalized (to be a probability vector) solution 
is 

X M X |x| 
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This is precisely the canonical distribution of probability over S which 
was obtained earlier from the maximum entropy principle. Thus, one 
sense in which the canonical distribution is an equilibrium distribution 
is that it is invariant under the transition probability matrices of z t . 

It will be noticed that the vector q has components which satisfy not 
only the statistical equilibrium equation (6) for z t , but also the much 
stronger condition 

q*a*u = c lu n u* X >V e ' s '> 

which is an analog of the principle of detailed balance. In the language 
of probability, this condition is that of reversibility; that is, it is equiva- 
lent to the condition that the process z t look the same whether seen 
forward or backward in the sense that for any two times t and s 

Pr [z t = x and z s = y\ = Pr [z t = V and z s = x\. 

The reversibility of z t has important statistical consequences, explored 
in Sections XII-XIV. However, an immediate consequence is the 
following form of the Boltzmann //-theorem for z t : 
Remark: Let 

H x (t) = -E qxy(t) log q xy {t), 

ytS 

where 

q IV (t) = Pr {z t = y | z = x\. 

Then 

(d/dt)H x (t) ^ <>• 

The proof of this is well-known, being just Pauli's proof of the 
quantum-mechanical //-theorem from the principle of detailed balance. 
(See Tolman, 9 p. 404.) 

IX. ANALOGY WITH STATISTICAL MECHANICS 

As its name suggests, the canonical distribution of probability over S, 
implied by the maximum entropy principle, resembles the canonical 
ensemble of statistical mehcanics and thermodynamics. This analogy 
extends to several other concepts arising either in traffic theory or in 
statistical mechanics, and will now be described. It is assumed that the 
reader is familiar with the rudiments of statistical mechanics; a lucid 
account can be found in Khinchin. 6 

Let us consider a conservative mechanical system embedded in a heat 
bath, and assume that it is described by a canonical ensemble. It can 
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excliange energy with its surroundings; its energy is a randomly varying 
quantity. The basic identification we make is of the number of calls in 
progress in a connecting network with the energy of this mechanical 
system. In other words, new calls in the operating network are analogous 
to increments of energy in the mechanical system, while hangups repre- 
sent decrements of energy. The average energy is identified with the aver- 
age load carried by the network. 

The surfaces of constant energy in the phase-space of the mechanical 
system are analogous to the levels Lk , i.e., the sets consisting of the 
various states with k calls in progress for A - = 0, 1, 2, • • • . The number 
| L k | of ways of putting up k calls, on which our theory rests, is the 
analog of the area of a surface of constant energy. Just as the canonical 
density function is constant over the surfaces of constant energy and 
maximizes the entropy for a given average energy, so is the canonical 
probability vector q constant over each L k and maximizes II (q). 

The partition function of statistical mechanics is defined (cf. lief. 6, 
p. 79) by 



Z(a) = f e-"'" r) dV(x), 



where F is the phase-space, .e e T is a typical state, H(x) is the total 
energy of state x (here given by the Ilamiltonian function), and dV is 
the volume element of phase-space. In a similar way, the partition 
function <!>( • ) is the generating function of the numbers | L k |, k = 
0, 1 , 2, • • • . The set S of states corresponds to the phase-space T, 
H(x) is analogous to | x |, the volume measure on T is analogous to the 
counting measure on »S', and e " replaces £. 

In Khinchin's development 6 of statistical mechanics the temperature 
is defined as inversely proportional to the unique root 6 of the equation 

(d/dd) log Z(d) = average energy. 

Specifically, the absolute temperature T is given by 

6 = (kT)-\ 

where /; is Boltzmann's constant. In our model for a connecting net- 
work the analog (modulo a logarithmic transformation) of 6 is the 
solution X of 

(d/d\) log3>(X) = average load carried. 

Thus it is tempting to identify (log X) -1 as proportional to the "tem- 
perature" of the traffic system. 

The matrix A, introduced in Section VIII as the "transition rate" 
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matrix for the process z t , provides a sense in which the canonical 
distribution q is of "equilibrium" type. The reversibility of z t is analogous 
to the detailed balance property of transition matrices in statistical 
mechanics. (Cf. Tolman, 9 pp. 165 and 521.) This property also implies 
that a form of the Boltzmann i/-theorem is valid for z t , as we saw in 
Section VIII. 

The analogies between our thermodynamic model of traffic and 
statistical mechanics can be collected in the following tabulation : 



STATISTICAL MECHANICS 

Energy 
Partition function 

Entropy 

Temperature 

Area of surface of given 

energy 
Transition rate matrix 
Detailed balance 
Equilibrium 
Heat bath 
Phase space V 
Volume measure on V 



TRAFFIC THEORY 

Calls in progress 

Generating function #( - ) of number of 
ways of putting up k calls, ^ k ^ w 



- 2 q x log q> 



{log (calling rate per idle path)} -1 

| J JI; | = number of ways of putting up k 

calls 
.4 

Reversibility of z t 
Aq = 

Idle customers' needs 
Set S of possible states 
Counting measure on S 



X. DISCUSSION AND CRITIQUE 

It is now reasonable to consider possible physical interpretations of 
the stochastic process z, and of the transition rate matrix A in terms of 
items describing behavior at the inlets and outlets of the connecting 
network, such as calling rates, holding-times, and routing rules. Obvi- 
ously, transitions of z t from a state x into B x represent hangups, while 
transitions from x into A x represent new calls; the entries of A indicate 
the "rates" at which these events occur in the different states. However, 
the reasonableness, and so the acceptability, of z t as a model for traffic 
depends on the interpretations of z t and .4 in physical terms. Hence we 
must inquire whether (and how) the rates entered in A can be viewed as 
realistically describing the terminations of calls in progress, the occur- 
rence of new calls between inlets and outlets of the network, and their 
routing or disposition. 

In general, to construct a Markov process as a model for traffic in a 
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connecting network whose states form the set «S', it is usually sufficient to 
give, for each state x t S, and each inlet u and outlet /', 

[i) the hangup rate for the various calls in progress in x, 
(ii) the calling rate between u and v in state x, 

(in) the method for disposing of requests that encounter congestion, 

receive busy tone, etc., 
(iv) the method for choosing routes of new calls. 
A particular choice of the items (i)-(ir) leads to a transition rate matrix, 
and so to a Markov process. We shall assess the reasonableness of z, as 
a model for traffic in terms of items (i)-(iv) above by exhibiting two 
choices of (i) (ir) that both lead to the rate matrix .4 of z, . 

In the dynamic model z, described in Section VIII, the role of the in- 
lets and outlets is open to (at least) two different interpretations, each 
of which induces a corresponding interpretation of the transition rate 
matrix A . 

One possible interpretation of the inlets and outlets is to take them 
seriously as actual terminals or customers' lines. They are then the outer- 
most portions of the network under study, the original sources for traffic 
that enters the system, beyond which there is no more connecting or 
switching equipment. From any inlet, or to any outlet, there can be at 
most one call in progress. In this case the rate matrix .1 can be inter- 
preted as saying that in a state x each call in progress is terminating at 
a unit rate, that the calling rate from an idle inlet u to an idle outlet v is 

X- number of available paths from // to /■ in state .<• 

= X- number of states covering x which include a (u,v) call, 

and that of the possible routes for a new call one is chosen at random 
(equal probability for all). The reader can verify that this choice of 
(i)-(iv) does in fact lead to the rate matrix .4. Note that this description 
does not provide for the generation of blocked calls. 

The choice of a unit hangup rate per call in progress is tantamount to 
measuring time in units of mean holding-time, with the convenience that 
carried and offered loads come out in the standard units of erlangs. This 
unit hangup rate can be obtained as a consequence of assuming that the 
holding-times arc negative-exponentially distributed with mean unity, 
mutually independent, and independent of the random process describ- 
ing new calls. This assumption of "negative exponential holding-times" 
is a standard one in congestion theory. (See e.g., Syski, 10 p. 9.) 

More interesting (and questionable!) is the fact that under this inter- 
pretation the calling rate in a state x between an idle inlet it and an idle 
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outlet v depends on the number of ways in which a call from u to v 
could be put into the network in state x. This calling rate can therefore 
change in time as the state changes, even if u and a remain idle. It can 
be argued that this is an unrealistic feature, and that therefore z, is not 
a wholly reasonable model for telephone traffic in a network whose in- 
lets and outlets are interpreted as terminals or customers' lines. For 
surely the idle calling parties do not know the state of the system, nor 
the number of paths available for a call between them, and so they can- 
not (let alone do not) adjust their calling rates accordingly. 

In a sense, it would be more intuitive and reasonable to assign a call- 
ing rate A to each idle pair (u,v) of terminals (an inlet u and an outlet v) 
irrespective of the state x of the system. This basic calling rate for each 
idle pair (u,v) is then distributed over the states that cover x and realize 
(u,v) [assuming that (u,v) is not blocked, so that there arc such states] 
in accordance with some routing rule. A stochastic process x t on S based 
on this idea is described in Section XI, and is studied in detail in a work 
(Benes") to appear later. 

From an a priori viewpoint, x t is a more reasonable model for traffic 
than z t . The objection (described above) to letting the calling rate for 
an idle pair depend on the state is severe. Nevertheless it does not neces- 
sarily destroy the usefulness of the process z t for describing traffic. Three 
comments are relevant here : 

(1) If all calls can be put up in at most one way, then x t and z t coin- 
cide. 

(2) If calls can be put up in only a few ways, it may often be possible 
or useful to regard z t as a small perturbation of x, obtained by raising 
various calling rates. This idea is explored in Section XI. 

(3) Even if z t is not in any precise sense a small perturbation of the 
a priori reasonable model x t , it deserves to be considered as a model of 
traffic. It must not be forgotten that the usefulness of a theory rests 
more on its success in predicting than on its meeting criteria of reason- 
ableness that are adduced a priori. 

However, it is possible to give the inlets and outlets a second interpre- 
tation, different from the one that assigns them the role of "outermost 
terminals." This interpretation makes z, a fairly reasonable model of 
traffic, in the a priori sense we are discussing. It consists in letting each 
inlet or outlet represent a point from which several or many calls can be 
in progress to other points in the system. Physically, such an inlet or out- 
let might be a l'BX or central office serving a locality. As such, it would 
itself contain a connecting network which is left out of account in the 
model. It no longer necessarily makes sense to speak of busy and idle 
inlets, or outlets. 
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To give an intuitive rationale for this interpretation and for the as- 
sumption about calling rates that corresponds to it, let us pick an inlet- 
outlet pair (u,v) and think of u and v as (possibly geographically sepa- 
rated) points between which there may be several calls in progress. For 
example, the network under study might be a toll network, and u and v 
might be local central offices acting as sources of traffic for the toll sys- 
tem. Or, for a second example, u and v might be distinct switches in a 
large network inside a central office. 

In such situations, it is natural to expect that if in a state there are 
many paths available for a call from u to v, then there is a larger proba- 
bility that a requested call from u to v arise in the next small interval of 
time h than if there were very few paths between u and v available. In 
other words, it is reasonable that the calling rate in x for (u,v) calls be 
a monotone increasing function of the number of paths available in x 
for such calls. 

A particularly simple monotone function is the linear one, and we shall 
assume that the calling rate for an idle pair (u,v) in x is 

X- number of paths available in x for (u,v) call, 

and that of the available paths one is chosen at random. Again, no pro- 
vision is made for the generation of blocked attempts, since these will not 
affect the state probabilities when blocked calls are refused. 
We observe that A x can be partitioned and written as 

A x = U A x (u,v), 

(u.v) 



where 



itl. 



A x (u,v) — {y.y covers x and realizes (u,v) 



| A x (u,v) | = number of paths available in x for a (u,v) call. 

Since routes for new calls are chosen at random we find that the transi- 
tion rate from x to y c A x is exactly X, so that this second interpretation 
also leads to the rate matrix A . 

XT. A MARKOV MODEL BASED ON TERMIN r AL-PAIR BEHAVIOR 

We now revert to interpreting inlets and outlets as the ultimate ter- 
minals of the connecting network. In Section X it was suggested that 
under this interpretation an a priori reasonable model (a stochastic 
process x t ) can be obtained by postulating an effective calling rate 
X > per idle inlet-outlet pair. This can be done by assuming that each 
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idle inlet calls an arbitrary outlet at a rate X, and vice versa, with at- 
tempted calls to busy terminals rejected with no change of state. The 
total attempt rate in a state x (excluding calls to busy terminals) is 

f number of idle 
(inlet-outlet pairs in x\ 

If J is the set of inlets, and ft that of outlets, with / and ft disjoint, this 
has the quadratic form 

X(|/| - I*|)(|0| - |*|). 

As before, we assume a unit hangup rate per call in progress, with 
blocked calls rejected. The description of x t can be completed, finally, 
by specifying a method of routing. This we do by introducing a "routing 
matrix" R = (r xy ) with the following properties: Let a; be a state, and 
let n be the partition of A x induced by the equivalence relation ~ of 
"having the same calls up, possibly on different routes"; then 



r,„ ^ 




r xy = unless 


Vi A.* 


2 r xy = 1 for 


Fell. 



We note that 2»«* r *« * s exactly the number s(x) of attempts which 
would be "successful" if they arose in state x, and that n consists of 
exactly the sets A x (u,v) for {(n,v)\ idle and unblocked in x. 

The routing matrix R is to have this interpretation: each time the 
call {(u,v)} is to be completed in state x, a state y is chosen inde- 
pendently from A x (u,v) with probability r xy and the call is routed so 
as to take the system to state y. 

The foregoing assumptions lead to a rate matrix Q for x t defined by 



3*» = \ 



1 

— | x | — \s(x] 

if y e (A x U B x y and y M x. 



if 


V t B x 


if 


y e A x 


if 


y = x 


if 


y e (A 



This matrix is exactly like A except that for y e A x the rate from x to y 
is not X but (the in general smaller quantity) Xr xu , and that the diagonal 
terms are correspondingly increased so as to keep row sums equal to 
zero. For each Fell, r xy for y e Y represents a distribution of the calling 
rate of some idle unblocked pair (u,v) over A x (u,v ) = Y. Indeed A 
results from Q if all the r xy are replaced by unity. The process x t can be 
defined in terms of its rate matrix Q. 
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The assumptions leading to the rate matrix Q and to the process x t 
have much a priori appeal; x t itself is discussed in detail in a forth- 
coming paper 11 already mentioned. Here we shall merely consider 
whether z t may be regarded as a perturbation of x t . Since each process 
is determined by its respective rate matrix, and since we are interested 
mostly in equilibrium behavior, we restrict attention to asking how 
different are the respective equilibrium distributions over S for x t and 
z t . Thus, if p and q are probability row-vectors satisfying Qp = and 
Aq = respectively, how different is p from q? 

To give a precise estimate, we introduce the norms 

ii m ii = m\ m *v i 



= £ 



t'r 



for matrices and vectors, respectively. Since Ap = (A — Q)p and 
Qq = (Q - A)q, we find 

||p- g |l* 2|| ?- A " 
1 - || Q - A || 

The norm of Q — A, in turn, can be seen to be 

II Q - -1 II = 2X E L (1 - r*y) 

xtS y*A x 

= 2X{*'(1) - E»W] 

xtS 

where 

s(.r) = number of pairs that are idle and not blocked in x. 

Letting 

n = max number of ways a call can be realized 

we find | A, | ^ jusO), and hence 

\\Q-A II ^ 2X( M - 1) £s(s) 



Let 



^ 2X( M - 1 )*>'(!). 



«) = max | .r 
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so that*'(l) ^ w | S |, and 

\\Q-A || S 2X(/i - l)w| S\. 
The average contribution (per state) to || Q — A || is then 

L^JZAI <g 2X( M - l)w. 

XII. THE APPROACH TO EQUILIBRIUM 

It is known from the theory of Markov processes that the matrix 
Q(t) = (q zy (t)) of the transition probabilities 

q xv (t) = Pr {z(t + s) = y \ z{s) = x], t^O 

of the process z t satisfies the Kolmogorov equations 

(d/dt)Q(t) = AQ(t) = Q(t)A, Q(0) = I, 

and that the study of the time-dependent (as opposed to the asymptotic, 
or equilibrium) behavior of z t can be carried out in terms of the charac- 
teristic values of A. Knowledge of the transition probabilities is essential, 
for example, in calculating the sampling error incurred in such load 
averages as 

-ZU„I, lf\'i\dt, (7) 

n j=i i •'o 

where t is the interval between successive discrete observations of 
| z t |, and (0,T) is an interval of continuous observation of | z t |. In this 
section we study the manner in which z t approaches equilibrium in 
terms of the two principal characteristic values of A , i.e., that of largest, 
and that of smallest nonzero, magnitude. Applications to estimating the 
covariances of functions of z, , and to studying sampling error for the 
traffic averages in (7), are described in Sections XIII and XIV, re- 
spectively. 

Our study of the approach to equilibrium is based on the observation 
that the matrix A of transition rates for the process z t is symmetrizable, 
i.e., is a self -adjoint operator in a suitably chosen inner-product space 
of finite dimension | S |. The probabilities 

Qx *(A) Mx 

are all strictly nonnegative, and we use their reciprocals \i x as weights 
in defining an inner product, 
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(r,s) = 2 W*x , (8) 

and a norm, 

II « II - (vO*. 
We now remark that for all states x,y from S, 

qyttux = Qxflxy 

or alternatively 

Indeed, this remark is the basis for the solution q given in Section VIII 
for the statistical equilibrium equations (6) of the process z t ; it has 
the important consequence that A is self-adjoint with respect to the 
inner product denned by (8), viz. 

Lemma 2: (Ar,s) = (r, As), for any \ S | -vectors r,s. 

Proof: A is a real matrix, so 

£ £ (VySj-M* =22 r v s x a xv ny 



— 2—i r uPu 2—i &zySx — 2—i r u 2—/ G-xySxHy 



V x 



In a similar way we prove 
Lemma 3: 

(Ar,s) = 4EE a„xQu(sxHx — x u n u )(ix x r x - ii y r y ). 

x y 

Proof: Since the matrix whose elements are a„ x p x is symmetric, we 
have 

(Ar,s) = X 2 QyxfixnSx 

x y 

= lEZ a ux nx(r u s r + I'xSy). 

x y 

Now 

2 2 (lyxQy^x\r x = 

because Aq = 0, and 

2 2 ayxqyiiy\r u = 2 £ o, ux n y s y r u 

x y y x 

= 
because ^ x a^ = 0. This proves the lemma. 
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Theorem 2: The characteristic values of A are real and nonpositive. 
Zero is a simple characteristic value corresponding to the characteristic 
vector q, normalized to unity. 

Proof: The result follows from the known properties of self -adjoint 
transformations. (See Ref. 12, pp. 153-155.) 

The characteristic; values of A will all be of the Rayleigh quotient 
form 

(Af^v) < Q 

(v,v) 

for some vector v; by Lemma 3 this form is nonpositive. The probability 
vector solution q of Aq = is unique so that zero is a simple charac- 
teristic value. Furthermore, if > r max = i\ ^ • • • ^ r\ S \-j = r m in is 
an arrangement of the characteristic values in decreasing order, the 
variational description of the characteristic values (Ref. 12, p. Ill) 
implies that with || v \\ = (v,v), 

?max = ri = max [(Av,v) \ v J_ q, \\ v || = 1} 

rmin = r lS \-i = min \(Av,v) \ \\ v \\ = 1}. 

The alternative notations r max and r min identify the two "dominant" 
characteristic values, and are introduced for later convenience to en- 
hance the symmetry of the theory. 

One can now estimate n from below by substituting suitable trial 
vectors in the Rayleigh quotient. Choosing a vector v with components 



where 



\x\ — m o 

V x = , X € O, 

Pfix 



m = J2 \ x | q x = A — log $(X) 

xt3 Oh 



,■-£(1.1 -«)■*-*(£, + £) log .ft), 

it is easily seen that (q,v) = 0, that || v || = 1, and that 

Uh v) = - 2 ? 2 «■** \—V— — ) 

= -^Z^hjI + xUJ). 

In equilibrium, the average rate of new calls equals the average rate of 
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hangups, as can be verified from the equilibrium equations Aq = 0. 
That is, 

E I V Iff* = x E Qy I A v I. 

1/eS y«S 



and we find 



-2sr,<0, 

fl~ 



a generalization of a result known (Ref. 13, p. 147) for the simple 
busy signal trunk group (classical Erlang model). 

In general, letting /( • ) be any function defined on the set S of states, 
but not identically a constant, we define 

'»/ = E/0')$x 

xtS 

v? = E (/(*) - m f ) 2 q x . 

xiS 

Choosing now a vector o with components 

v _ fix) - m.f 

(T/Hx 

we obtain (by repetition of previous reasoning) 

-^ E Qu (E U(x) - f{y)f + x E [/(*) - fiy)f\ 

L(Jj- „ \xtB z XI Ay J 

as a lower bound for r\ . 

We now define a set of vector-valued functions \c x (t), x e S, I ^ 0} by 
the condition 

Cx V (t) = q zy (t) — q„ , y € S. 

The function c x ( • ) describes the approach to equilibrium from the 
initial state x at time t = 0. 
Theorem 3: For t ^ 

II c*(0) || exp (r mia t) ^ |[ Cr(0 || ^ || c*(0) || exp (r max /). 
Proof: Since q u and gj„(0 are both distributions in y, we have 

(cx(t),q) = E fc»(0 - 9, = 

v 

so that c x (0 _L q. Also 
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j t || c,(0 || 2 = 2 E [q xv (t) - qi }j t q xy {t)^ 

-»(*■»*) 

= 2(c x ,Ac x ) 
since (d/dt)c x = Ac x ; that is for each J/fS, 

at at z 



Hence, || c x || being nonzero, we find 

2r mi „ ^ (d/dt) log || c x || 2 g 2r max 

and Theorem 3 follows by integration. The argument just given is es- 
sentially reproduced from Kramer. 14 

XIII. COVARIANCES OF FUNCTIONS OF Z t 

For the purposes of this section it is convenient to introduce an inner 
product (•,■)', closely related to but different from ( • , • ) of the previous 
section, and defined by 

X tS 

The associated norm is denoted by || r ||' = (r,r)'K The point of the 
"prime" notation is explained by the fact that the transpose A' of A is 
self -adjoint with respect to (•,•)'• 

Remark: Where A' is the transpose of A 

(A'r,sY = (r,A'sY. 

Proof: 
2 E a Tu r y s x q x = J £ a UI s x r y q,j 



1/ x y x 

Let /( • ) be a function defined on S, and define a stochastic process 
/( by the condition 

Theorem 4-' The covariance of ft is given by 
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n=0 ni 

where the vector f is defined by 

fr = f(x) - E /<*)*, 

xeS 

= f(x) - m f . 
Proof: The covariance of f t is 

E E q*Q*v(t)f*fv = (f,Q(t)'fy 

x U 

= (f,(exptA)'fy 

n=o n i 

with Q(t)' denoting the transpose, and not the derivative, of Q(t). The 
covariance of f t is thus the exponential generating function of the series 
of numbers 

(f,A'"fy n = 0, 1,2, ••• . 

These can be calculated with the help of the following results: 
Lemma 4: Let the matrix elements of A " be a IU (n) . Then 



q x a xy {n) = q u ay X {n) . 



Proof: 



q x a xy n = q* E a iui a ui „ 2 ••• a„ n _ iy 

«1 ,-•• ,«n-l 

E a, M *a u .,„, ■ • • a„ H „_ 19y 

Ul,---,U„_l 



= q y a yx n) . 



Lemma 5: Let Q be the diagonal matrix of elements q x , x e S. Then 

(w,A'"w)' = (A"Qw,Qw). 
Proof: 



E^Z a xy n) w u q x = E (Qw)* E M*a*» (B) (Qio)» 
= E (Qm>)« E aJ n) TQwT u Mx ■ 
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From the three preceding results we obtain 
Theorem 5: The covariance of ft is 

Rf(t) = E -. U n Qf,Qf), 

n=0 ">' 

where the vector f is as in Theorem 4, and Q is the diagonal matrix of 
elements q x , x e S. 

It is readily seen that A n , n ^ 1, is again a self -adjoint operator with 
respect to (•,•)> and that its characteristic values are precisely the 
nth powers of those of A. Also, for any vector v and n ^ 

(A n vv)l-° if niS ° dd 

\^ if n is zero or even 

so that by the variational description of characteristic values we have 

r min n , nodd I U" y,y) ^ J*W*, n odd ^ 

r max n , n even J ( v > v ) [r mi „", n even 

provided that v _L q (in the inequalities involving r max ). Returning now 
to the vector Qf of Theorem 5, we find 

\\Qf\\ 2 = LfrVxV. 

X 

= X) (f( x ) ~ m s)% = °? 

reS 

and 

Z) Qx(Qf)dix = Z) ««■/* = °» 

so that Qf _L g. Letting y = Q/ in (10), we obtain 

cr/r min n S (A n Qf,Qf) ^ <r/r max n , n odd, 

*/Vw w ^ (A n Qf,Qf) g <r f r min n , n even. 

Unfortunately, these inequalities do not give useful bounds for the 
covariance R f (-). However, such bounds can be obtained from the 
formula of Theorem 5 in an elegant way by applying the spectral 

theorem to A. 

Theorem 6: Let a x , • • • , a k denote the distinct characteristic values of A, 
and let E iy i = 1, • • •, h, denote the perpendicular projection on the sub- 
space of all solutions Ar = car. Then the covariance R f { • ) of ft is given by 

Rf(t) = E (EMQf)e ait , 

t'=0 
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with 1 ^ k S | S |, Q the diagonal matrix of elements q x , x e S, and f 
given by f x = f(x) — m f . 

Proof: By the .spectral theorem for self -adjoint operators (Ref. 12, 
p. f>fi) we can write 

A = £ a#i 

i=0 

and 

A" = i^oi u B im 

.=o 

We can now calculate with formula (9) of Theorem 5: 

«/(0 = E-,U"Q/,W 

= £(i^Q/,Q/) <?"•'. 

1=0 

This proves Theorem 0. Since we know that zero is among the charac- 
teristic values (indeed, it is a simple one), one of the a's, say a , will 
be zero. We may reasonably expect R/( ■ ) to approach zero for large t; 
hence the constant, i.e., «„ , term of R f ( • ) should be zero. This (tan be 
seen as follows: the subspace associated with zero consists of vectors 
proportional to the equilibrium vector q, because zero is a simple charac- 
teristic value; but we have already verified that q _l_ Qf; hence 

(E r,Qf) = 0, all r. 

Using this we prove the 

Corollary 1: R f (t) ^ for all t, and in fact 

^ cr/V" 1 " 1 " 1 ^ R s (t) ^ ff/V-"" 1 , allt. 

Proof: Since the A\ of Theorem 6 are perpendicular projections, they 
are linear, self-adjoint, and positive in the sense of Ref. 12, p. 140; the 
usual term for positive is nonnegative semidefinite. Hence 

(£>,/•) ^ 

for any vector /•. Since (E r,Qf) = if E is associated with the zero 
characteristic value, the result follows from Theorem G, using 
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E (EiQf,Qf) = || Qf || 2 = *f. 



I 

i-0 
XIV. APPLICATIONS TO SAMPLING ERROR 



Let us suppose that n samples of the process /,( = f(z,)) are observed 
during an interval of equilibrium of z t at intervals t apart, and that the 
normed sum 

rT'Sn = n' 1 E/ iT 

is used as an estimate of #{/<} . We find that 

Var [S n ] = E (» - lil)«/0>). 
/=— n 

where #/( • ) is the covariance of /«. By using the identity 

" 1 — e -2 "" 
E ( w _ \J \)*~* mu = nctnhu - csch~u, 

= v„(m), 

together with Corollary 1 of Section XIII, we find that 

a/'l» B (— irrmta) ^ Var {&,} ^ <r/v„(— irrmax). 

In a similar way, if /< is observed continuously over an interval 
(0,7') of equilibrium of z t and the time average 

M(T) =±f o T f(z t )dt 

is used as an estimate of E[f t }, then 

Var [M(T)\ = 2T~ 2 [ (T - t) R f (t) dt, 

and Corollary 1 gives 

f (T - t) e' nint dt ^ Var [M(T)\ ^ tr/ [ (T - t) e' m "' dt. 

J J o 



2 

'0 



XV. A GENERALIZATION 

As an extension of the maximum problem posed and solved in Section 
V we shall seek functions 

aJt). xeS, h<*t<>h, h<U. 
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such that for each t in [t y ,t 2 ] 



2 | x | g,(0 = m(t) > 

xfS 

/ II(q(t)) dt = maximum. 



In other words, we look for a time-dependent distribution of proba- 
bility over S with prescribed mean values for the function | • | on S, such 
that the integral of the entropy functional over (h ,/ 2 ) is a maximum. 

The Euler equations for this problem assume the trivial form (with 
L\{ • ) and //•_>( • ) as Lagrange's multipliers) : 

(dll/dq x ) - L^t) | x | - Lo(t) = 0, x e 8 

or, writing out the //-derivative, 

log ?,(0 + 1 + Lv(t) \x\ + U(t) = 0, x e S. 

The argument of Lemma 1 following equation (4) shows that g x ( ■ ) is 
given by 



m(0 = . . . = My- log <J>(w) ) 

/u=X(0 • 



where X(" ■ ) is the unique solution of the equation 

E l*|x<o w / d 

*(X(0) V*« 

This solution has the form of the canonical distribution at each time 
point in [/, ,/ 2 ], and Theorem 1 in effect is just the special case of this 
result that arises when m(t) = m. It is apparent that the form of this 
solution does not depend on what interval [^i ,to\ was considered, so we 
may assume that m( ■ ), and hence also X( • ) and q(-), are defined on 
the real axis. 

Let us define the matrix-valued function A(t) by A(t) = (a xv (t)) 
where 

fl if y*B x 

a <t\ - J x w if y* A * 

" { ' ' ' \- \x\ - \{t) I A x \ if x - y 
|0 otherwise. 

In other words A(t) is obtained from the transition rate matrix A or z t 
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by replacing the constant X by the function X( • )• Then for each t 

A(t)g(t) = 
i.e., 

{|x| + X(0 | A x \}q x (t) - E Quit) + X(0 E 9,(0- 

Thus an analog of the statistical equilibrium equation holds at each 
point in time, and in this sense, a system described by [q(t),k ^ t ^ t 2 ] 
may be said to be locally in equilibrium throughout the interval (k ,t 2 ). 
Let us now redefine the process z t to be the time-dependent Markov 
process corresponding to the (time-dependent) transition rate matrix 
A( • ). We know that if X( • ) were a constant function with the particular 
value Mn), then the process z t would have a stationary or equilibrium 
distribution over S given by 

, , [X(tt)] |J| 

We may therefore expect that if X( • ) is not constant, but changes only 
slowly with time, and if z has the absolute distribution (vector) q(0), 
then z t for t > has a distribution approximately given by q(t). Let us 
see in detail how this occurs. 

The transition probability matrix 

Ov(Wi) = Pr ! 2 '2 = v\ z h = &1. 

is now indexed by two time parameters instead of one, because of the 
time-dependence of z t . The forward Kolmogorov equation for Q( ■ , • ) is 

(d/dt)Q(u,t) = Q(u,t)A(t), u < t, 

or 

(d/dt)q xu (u,t) = -[| y I + Mt)s(y)]q xu (u,t) 

+ E 3«(0 + *(0 E a»(0 

with Q(i,<) = f ■ I* i s easily seen that 

Q(tt,0 = exp / Q(u>) du>, 

•'u 

the exponential of a matrix being defined by the usual series in powers 
of the matrix. Therefore if 

Pr {z = x) = g,(0), 
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then 

Pr [z, = x] = £Pr [20= y\qy X (t), 

VtS 

and the absolute distribution of z t is given by the vector 

Q(0,t)q(0). 
We now write 

Q(0,t) = exp tA(t) + jexp j A(u) du - exp lA(t)\ , 



observe that 



/ A(u) du - tA{t) = j udA(u) 



and make this heuristic argument: Since dA/du is small, Q(0,t) is 
approximately exp (A(t); however, for t fixed and u varying 

exp uA(t) 

is the transition probability matrix of a process z t obtained by setting 

\(u) = X(0- 

This matrix approaches, as u — > oo, the matrix each of whose rows is 
q(t). If / is large compared to the time it takes this to happen, we may 
expect, by Theorem 3, that 

exp tA(t) 

have rows all approximately equal to q(t), so that 

distr [z t \ W«(0. 

The informal argument just given can be made precise. For the 
purposes of this last section, we again introduce the vector norm 

11/11 = SI/. I 

xeS 

and the matrix norm 

II M || = El "* I- 

x.ytS 

Also, we use the following result: 
Lemma 6: Let M,N be \ S {-dimensional square matrices, with 
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c = \ sup | m xy \ + \ sup | n xu j 

x.ytS x.ytS 

Then for integers k ^ 1 

|| M k - N k || ^ (2c | S I)*" 1 || M - N ||. 
Proof: There obtains the identity 
M 4 - AT* = \{M + N)(M k - 1 - AT* -1 ) + J(Jf - ADM* -1 + AT*- 1 ). 
If (&«,,) are the elements of £ and (b xu ik) ) are those of B k , then 
sup | & W (W \ ^ \S | -sup | &, w (t_1) | -sup | b xy \. 

Hence, with k > 1 

I ft— 1) , (t— 1) I «-- I O ifc-Z/'O,.^ -1 

sup I m iB + n xy \ %\ b\ (2c) , 
and 

rfc— I I ,rJ-l\ II ^ /o_ I O l\*— ] 



(M - N)(M*- 1 + AT* -1 ) || ^ (2c | S |) -II M - N \\. 



Also, 



Thus 



k-1 nrk-l 



M + N)(M*~* - AT") || ^ c | 5 |-|| AT" 1 - AT 



|| il/ A " - N k || ^ c | S HI il/ fc_1 - AT" 1 || + I || M - AT || (2c | S 

^ || M - N || (2c | SI)*' -1 

• [(tf- 1 +\T.{c\a\) i yo\8\r'\ 

^ || M - N \\(2c\S\) k ~\ 
Using the lemma we find that the norm of 

exp / A{u) du — exp tA(t) 

is at most 

A(u)duJ - t"A r (t) 



< 



b 

e — 



nl 



1 II f 

Jn 



where 



I f 

6 = 2 | 5 | sup / a xy (u) 



rf« 



z,I/ ^0 



/ II "0 

+ t mp\a xu (t) | . 
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It can be verified that 



/ u dA(u) 

J n 



-2E 



u d\(u) 



^ 2*V(1)« sup \\'(u) I. 



e[0,«] 



Thus if \'( • ) is small on [0,/] the distribution of z t is nearly 

(cxp tA(t))q(0) 

(in the sense of the vector norm of this section). By Theorem 3, how- 
ever, this wall be nearly q(t) (in the sense of the norm defined by q(t)) 
if t is large compared to the time it takes exp uA(t) to approach its 
limit asu-* m, 

XVI. ACKNOWLEDGMENTS 

The author expresses his appreciation of the help and encouragement 
given him in the preparation of this study by A. Descloux, J. R. Pierce, 
II. O. Pollak, J. Riordan, and E. Wolman. 



REFERENCES 

1. Bones, V. E., Heuristic Remarks and Mathematical Problems Regarding the 

Theory of Connecting Systems, B. S.T.J. , 41, July, 1962, pp. 1201-1247. 

2. Benes, V. E., Algebraic and Topological Properties of Connecting Networks, 

B. S.T.J. . 41, July, 1062, pp. 1240-1274. 

3. Lee, C. Y., Analysis of Switching Networks, B.S.T.J., 34, November, 1055, 

pp. 1287-1315. * 

4. LeGall, P., Methode de Calcul de L'encombrement dans les Systemcs Tede- 

phoniques Autoniatiques a Marquage, Ann. des Telecom., 12 (1957), pp. 
374-386. 

5. Jensen, A., An Elucidation of Erlang's Statistical Works Through the Theory 

of Stochastic Processes, in The Life and Works of A. K. Erlang, Trans. 
Danish Acad. Sciences, 1948, pp. 23-100. 

6. Khinchin, A. I., Mathematical Foundations of Statistical Mechanics, Dover, 

New York, 1949. 

7. Birkhoff, G., Lattice Theory, Amer. Math. Soc. Colloq. Publ. XXV, rev. ed., 

1948. 

8. Doob, J. L., Stochastic Processes, John Wiley and Sons, New York, 1953. 

9. Tolman, R. C, The Principles of Statistical Mechanics, Oxford Univ. Press, 

London, 1955. 

10. Syski, R., Introduction to Congestion Theory in Telephone Si/stems, Oliver and 

"Boyd, London, 1960. 

11. Benes, Y. E., Markov Processes Representing Traffic in Connecting Networks, 

to appear. 

12. Halmos, P. R., Finite-Dimensional Vector Spaces, second ed., Van Nostrand, 

Princeton, 1958. 

13. Benes, V. E., The Covariance Function of a Simple Trunk Group, with Appli- 

cations to Traffic Measurement, B. S.T.J. , 40, January, 1961, pp. 117-14S. 

14. Kramer, H. P., Svmmetrizable Markov Matrices, Ann. Math. Stat. 30 (1959), 

p. 149. 



